Summarizing Source Code using a Neural Attention Model
نویسندگان
چکیده
High quality source code is often paired with high level summaries of the computation it performs, for example in code documentation or in descriptions posted in online forums. Such summaries are extremely useful for applications such as code search but are expensive to manually author, hence only done for a small fraction of all code that is produced. In this paper, we present the first completely datadriven approach for generating high level summaries of source code. Our model, CODE-NN , uses Long Short Term Memory (LSTM) networks with attention to produce sentences that describe C# code snippets and SQL queries. CODE-NN is trained on a new corpus that is automatically collected from StackOverflow, which we release. Experiments demonstrate strong performance on two tasks: (1) code summarization, where we establish the first end-to-end learning results and outperform strong baselines, and (2) code retrieval, where our learned model improves the state of the art on a recently introduced C# benchmark by a large margin.
منابع مشابه
Summarizing Git Commits and GitHub Pull Requests Using Sequence to Sequence Neural Attention Models
Every day millions of developers and programmers push commits to GitHub to ensure their projects are version controlled, reproducible, and remotely accessible. There are nearly 20 million public repositories (collections of source code in the form of projects) on GitHub today, and over 16 million unique users. Users are able to commit additions or changes to their own repositories, as well as t...
متن کاملModeling and Optimization of Anethole Ultrasound-Assisted Extraction from Fennel Seeds using Artificial Neural Network
Extraction of essential oils from medicinal plants has received researcher’s attention as it has a wide variety of applications in different industries. In this study, ultrasonic method has been used to facilitate the extraction of active ingredient anethole from fennel seeds. Effect of different parameters like extraction time (20, 40, and 60 min), power (80, 240, and 400 Watts) and solid part...
متن کاملA Convolutional Attention Network for Extreme Summarization of Source Code
Attention mechanisms in neural networks have proved useful for problems in which the input and output do not have fixed dimension. Often there exist features that are locally translation invariant and would be valuable for directing the model’s attention, but previous attentional architectures are not constructed to learn such features specifically. We introduce an attentional neural network th...
متن کاملEstimation of dosimetric parameters of I-125 brachytherapy source model 6711 using GATE8.1 code
Brachytherapy is one type of internal radiation therapy where radiation sources, which are usually encapsulated are placed as close as possible to the tumor site inside the patient's body. In this technique, it is important to determine dose distribution around the brachytherapy capsule. Hereby, in this paper, dosimetric parameters of I-125 brachytherapy source model 6711 are estimated accordin...
متن کاملDevelopement a simple point source model for Elekta SL-25 linear accelerator using MCNP4C Monte Carlo code
Background: Monte Carlo (MC) modeling of a linear accelerator is a prerequisite for Monte Carlo dose calculations in external beam radiotherapy. In this study, a simple and efficient model was developed for Elekta SL-25 linear accelerator using MCNP4C Monte Carlo code Materials and methods: The head of Elekta SL-25 linac was simulated for 6 and 18 MV photon beams using MCNP4C MC code. Energ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016